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A method for the simultaneous determination of the energy scales for fe-quark jets and light jets, 
the jet energy resolution, and the top quark mass at hadron colliders is presented. The method 
exploits the unique kinematics of events with top-antitop pair production, where one of the top 
quarks involves a leptonic and one a hadronic W boson decay. The paper shows a feasibility study 
of how this simultaneous measurement can be performed at the upcoming LHC experiments ATLAS 
and CMS. 
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Precise knowledge of the energy scale for 6-quark jet reconstruction will be crucial for many measurements at the 
ATLAS and CMS experiments at the upcoming LHC collider at CERN, including decays of the top quark, Higgs 
boson, and supersymmetric particles. The determination of the absolute energy scale for calorimeter jets has however 
— — l' proved to be a challenging task for the CDF and DO experiments at the Fermilab Tevatron collider. For light jets, 
this scale can be determined from collider data using events containing an isolated energetic photon balanced by a 
I ' jet In contrast to the calibration of charged-lepton reconstruction, massive resonances decaying to a jet-jet final 
i' state cannot readily be used for jet energy calibration; for example, the reconstruction of Z — > bb decays is difficult 
because of the large backgrounds. The only exception is the reconstruction of hadronic W decays in events containing 
a top-antitop pair, where one of the top quark decays (charge conjugate modes are implicitly included throughout 
this paper) involves a hadronic, and the other a leptonic W decay ("£+jets tt events"). Based on such events, both 
CDF and DO have performed simultaneous measurements of the top quark mass and the calorimeter jet energy scale 
Sj for light-quark jets 0]. No separate determination of the 6-quark jet energy scale is available so far, and in these 
measurements, the uncertainty on the relative difference between the energy scales for light-quark and &-quark jets 
leads to one of the largest remaining systematic uncertainties on the top quark mass. 

In this paper, a novel measurement technique is presented that allows a simultaneous determination of the top quark 
mass mt, light-quark jet energy scale Sj, relative light- to 6-quark jet energy scale Sb, and the jet energy resolution 
R from £+jets tt events recorded at a hadron collider. The energy scale Sj is defined like for example in i.e. a 
scale larger than 1.0 corresponds to a calorimeter response larger than unity, which means that the reconstructed 
jet energy is too large on average. The factor Sj is applied to all jets, and an additional factor Sb is introduced to 
• • ■ describe differences between light-quark and 6-quark jets. The method to determine all four parameters relies on 
the fact that the dependence on Sb is in general different for the two reconstructed invariant top quark masses per 
event. This is because the transverse momentum (relative to the beam) of the neutrino from the leptonic W decay 
is reconstructed by imposing transverse momentum balance of the event, which means that it depends on Sb if the 
magnitude of the vector sum of the two &-quark transverse momenta is large, while there is no such dependence 
for the hadronic W decay. Events with small vector sum of the two 6-quark transverse momenta cannot contribute 
independent information on nit and Sb- 

In the following it is assumed that calorimeter noise subtraction and the full calorimeter calibration up to a constant 
scale (which may be different for light and 6-quark jets) have been performed before the technique described here is 
applied. Noise studies can be performed with events recorded with minimal trigger requirements; the pre-calibration 
can be based on the transverse momentum balance in dijet events. In broad terms, the method to determine the 
top quark mass and absolute jet energy scales then comprises the following steps: A sample of candidate f+jets tt 
events that contain independent information on m± and Sb is selected. To measure the four parameters mt, Sj, Sb, 
and R, estimators are calculated for each selected event. The number of estimators does not have to be equal to the 
number of parameters to be determined; in the analysis described here, the four parameters are measured using three 
estimators m r t cc °, Sj cc °, and Sl eco which are chosen so that they are related with mt, Sj, and Sb- Functions are derived 
to describe the expected estimator distributions (templates) for any given set of assumed values of the parameters 
mt, Sj, Sb, and R. The measured estimator distributions in the data are then compared to these fitted templates, 
and the m t , Sj, Sb, and R values and their uncertainties and correlations are determined in a fit. To validate the 
procedure, the analysis is performed on simulated pseudo- experiments. To be precise, since the method is based on a 
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fit to simulated templates, it determines the ratio of energy scales in data and simulated events. 

The method should be applicable to both the LHC and the Tevatron experiments; as a concrete example, the 
experimental situation at the LHC (proton-proton collisions at a center-of-mass energy of 14 TeV) is considered in the 
following. Signal tt £+jets events are generated at parton level and smeared according to a parameterized detector 
resolution. No gluon radiation and no backgrounds are included, and it is assumed that the two 6-quark jets are 
identified unambiguously, i.e., the efficiency e& for b- quark jet identification may be smaller than unity, but no light 
jets are misidentified. The aim of this paper is thus to establish the feasibility of the calibration technique for such an 
idealized case; conditions that are closer to reality and systematic uncertainties will be addressed in a later publication. 
In general, it cannot be assumed that the 6-quark jet energy scale measurement is unbiased by the restriction to those 
6-quark jets that are identified. However, this is not a drawback as most applications, i.e. measurements for which 
a 6-quark jet energy scale determination is needed, will also use identified b jets. Residual biases due to differences 
in event topologies between top-antitop events and other event classes will have to be studied with simulated events, 
but can be considered a second-order effect. 

The paper is organized as follows: Section HT1 contains a general discussion of the kinematic reconstruction of £+jets 
tt events used in the measurement technique presented here. The generation of simulated events and the event 
selection are described in Section HTT1 and results of a toy-Monte Carlo study of the reconstruction method, based on 
realistic parameters for the resolution of the LHC detectors, are discussed in Section HVl Section IV1 summarizes the 
findings and gives an outlook and conclusion. 



II. KINEMATIC EVENT RECONSTRUCTION 



This section describes the event- by- event computation of the estimators mj oco , S v ° co , and S , ^ cco . It should be noted 
that these calculations can be performed for any £+jets tt candidate event and that the validity of this section is 
not restricted to the simplifying assumptions made in the generation of events used later in this paper. While the 
information in templates obtained with more realistic conditions may be somewhat diluted, the general arguments 
made in this section will still hold. 

The signature of a £+jets tt event in the detector is the presence of four energetic jets (two 6-quark jets from the top 
and antitop, and two light-quark jets from the hadronic W decay), an isolated energetic charged lepton (only electrons 
and muons are considered here, as r leptons decay close to the interaction point leading to additional complications), 
and missing transverse momentum (with respect to a balanced event) due to the undetected neutrino. There may be 
additional jets from gluon radiation, however such jets often have lower transverse energy than the tt decay products. 
Such jets are not considered here, but a minimum jet transverse energy is always required in the experimental event 
selection (such a cut is also included in the study described in Section InTf . 

When the masses of the six tt decay products are assumed to be known, there are 18 unknowns from the 6 final- 
state particle 3-momenta per event, and in addition 3 unknowns corresponding to the estimators mJ eco , S^ oc °, and 
S™ 00 to be calculated on an event-by-event basis. There are 17 measurements in each event corresponding to the five 
3-momenta of the four jets and the charged lepton and to the two components of the missing transverse momentum. 
(In fact, the missing transverse momentum is a derived quantity, calculated from the vector sum of the momenta 
of all other final-state particles and other "unclustered energy" in the event that is not assigned to final-state jets. 
By imposing transverse momentum balance, the neutrino transverse momentum can be identified with the missing 
transverse momentum, but its reconstructed value is not independent of the reconstructed jet and charged-lepton 
momenta.) In addition to the above 17 measurements, 4 constraints can be applied in each event, corresponding 
to the invariant masses of the two W bosons and two top quarks. Given that there are as many measurements as 
unknowns, the event kinematics can be solved and the three estimators m\ eco ', Sj cco , and Sl ec ° calculated. 

The correct assignment of jets to final-state quarks is not known a priori, but it is assumed that the two light-quark 
jets can be assigned unambiguously to the hadronic W decay. The known mass mw of the hadronically decaying W 
boson yields information on the energy scale Sj for light-quark jets. It is assumed that the scale factor is the same 
for both jets, such that the estimator Sj eco is calculated as 

raw 

^reco = _W_ (!) 

J mw 

where m^ w is the mass obtained from the reconstructed jet energies and momenta. For the following discussion, the 
measured energies and momenta of all jets are divided by the estimator Sj cco obtained in the same event, and the 
reconstructed missing transverse momentum is adjusted accordingly. 

There are two possibilities for combining the two light-quark jets with one of the 6-quark jets to reconstruct a 
top-quark decay. To proceed, one can either select one combination (as done in Section iHlj) , or consider both of them, 
possibly together with a suitable relative weight. For a given combination, a scan over 5™°° values is performed. 
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Given an assumed value of Sl eco , the reconstructed 6-quark jet energies and momenta are scaled accordingly, and 
the missing transverse momentum is adjusted and taken as transverse momentum of the neutrino from the leptonic 
W decay. The longitudinal neutrino momentum p* is then obtained from the known mass raw of the leptonically 
decaying W as 

m 2 w = 2E e y/ (pTf + (pj ) 2 - 2 {pj ■ p T v + pfpl) 
E e 2 -pf 

If the solutions are real, one obtains one value for the reconstructed mass HiJ e j° p of the top quark with the leptonic 
W decay for each p z v solution. The reconstructed mass wj c jj° d of the top quark with the hadronic W decay is also 
computed for the given S& value. If one finds m Y t e ^° p = m ( C had' then tms *°P <l uar k mass and the corresponding S^ 6 ™ 
value can be taken as estimator values for the event. There are up to four solutions per event: two assignments of 
6-quark jets and two solutions to Equation IJ5J. 
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III. SIMULATION AND EVENT SELECTION 



The kinematic event reconstruction and the measurement of the four quantities m t , Sj, Sb, and R have been tested 
with the experimental resolution of the ATLAS and CMS experiments in mind. The ALPGEN [1] program has been 
used to generate i'+jets tt events in proton-proton collisions at a center-of-mass energy of 14TeV for true top quark 
masses between 160 and 190 GeV in steps of 5 GeV. To simulate the effect of jet reconstruction in the experiment, the 
energies of the final-state quarks have been smeared according to a Gaussian resolution whose width a(E) is set to 

a(E) = R^fE (3) 

with values of R between 0.8 and 1.2 V GeV varied in steps of 0.1 vGeV (this corresponds to a resolution <r(E)/E 
between 80%/-/E and 120%/VE with E in units of GeV). The jet energy resolution of the ATLAS and CMS 
experiments is expected to be of the same order of magnitude [jj. Finally, all jet energies and momenta are multiplied 
by a factor Sj between 0.8 and 1.2 in steps of 0.1, and independently, another multiplicative scale factor of Sb between 
0.8 and 1.2 in steps of 0.1 is applied to 6-quark jet energies and momenta. 
A preselection of £+jets tt candidate events is first applied: 

(PI) A minimum charged-lepton transverse energy of 20 GeV is required. 

(P2) The pseudorapidity rj of the charged lepton and all jets must be between —2.5 and 2.5. 

(P3) The angular separation A1Z = \J (Ar/) 2 + (A0) 2 between the charged lepton and any jet must be larger than 

0.4, and the same must be true for any jet-jet pair. 
(P4) After smearing and energy scaling, the transverse energies of all jets must be larger than 30 GeV. 

The estimator 5*™°° is calculated from the reconstructed mass of the hadronically decaying W as in Equation (JTJ) , and 
all reconstructed jet energies and momenta are scaled by l/S^ cco . Events in which the jet assignment is unambiguous 
and that contain independent information on the top quark mass and 6-quark jet energy scale are then selected by 
these criteria: 

(51) To unambiguously assign the two 6-quark jets, it is required that exactly one of the 6-quark jets have an 
invariant mass together with the light-quark jet pair between 150 and 200 GeV. In the following, this 6-quark jet 
is assumed to come from the decay of the top quark with the hadronically decaying W. For a sample generated 
with m t = 175 GeV, Sj = Sb = 1, and R = lvGcV, 74% of the events that pass the preselection fulfill this 
criterion. 

(52) As explained in Section HI events with small magnitude of the vector sum of 6-quark jet transverse momenta 
cannot yield independent information on the top quark mass and 6-quark jet energy scale. Therefore, this 
magnitude is required to be larger than 50 GeV. 80% of the events that pass cut (SI) are retained. 

(53) Events are only retained if exactly one solution to Equation ([2]) is found with 0.5 < Sl° co < 2.0 and 150 GeV < 
TO reco ^ 200 GeV. Thus a choice between several possible estimator values is avoided. Of the events that pass 
criterion (S2), 38%, 45%, and 17% have 0, 1, and 2 such solutions, respectively. More than 2 solutions per event 
do not occur. Thus, 45% of the events that pass selection criterion (S2) also pass this cut. 
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FIG. 1: For simulated events with m t = 175 GeV, Sj = St, = 1, and R = l\/GeV, the distribution of the quantity Af rcco for 
events passing selection cut (S3). The selection applied in (S4) is indicated. The dip at M rcco = is due to the fact that such 
events cannot yield a solution for m T t °i° p = m I t c '^ d . 



(S4) Finally, the quantity M reco := ^^^^ is obtained during the scan of S r b cco values and taken as a 
measure of whether an event contains independent information on the top quark mass and 6-quark jet energy 
scale. The M TCCO distribution in all events that pass cut (S3) is shown in Figured] In events with small |Af rcco | 
the value of Sl ec ° has a large uncertainty, and thus events with |M reco | < 30 GeV are rejected. The m\ eco and 
grcco templates for events with M roco < — 30 GeV have a degraded resolution compared with the distributions 
for M reco > +30 GeV, and only the latter events are retained. This criterion thus selects events in which the 
value of m\ c l° v decreases much less rapidly than toJj^ or even increases with Sl° co . A fraction of 11% of events 
passes this final cut, yielding an overall efficiency of 2.9% after the preselection. 

The effect of criteria (S2) and (S4) is illustrated in Figure^ Plot (a) shows the m\ ec ° vs. S r b aco estimator distribution 
for a sample generated with m t — 175 GeV, Sj = Sb = 1, and R = lvGcV when all selection cuts except (S2) and (S4) 
are applied; plot (b) shows events that pass the full event selection. It is evident how the m\ cco and 5™ co resolution 
is superior in those events that pass all selection cuts; the correlation is reduced from —0.76 to —0.62. Plot (c) shows 
the m\ ec ° vs. Sl° c ° estimator distributions for events passing the full selection for various input values; it can be seen 
that independent information on mt and Sb is contained in the events. 

It should be noted that the present analysis is intended as a conceptual study and a proof of principle. Consequently, 
no optimisation of the cut values in the individual selection criteria has been performed. 

Estimator distributions for various choices of input parameters are shown in Figure [3J This figure shows clearly the 
sensitivity to a variation of the input parameters. 

IV. TEMPLATE FIT 

Estimator distributions have been determined from the simulated events described above for various m t , Sj, Sb, and 
R input values. To become insensitive to statistical fluctuations of these templates, functions T e have been derived 
to describe each of the three estimator distributions, e = m\ cc °, Sj CCO , Sl° co . The m\ BCO templates are described with 
a normalized double Gaussian, and normalized single Gaussian functions are used to describe the S™ co and Sl ec ° 
templates. The parameters that describe these functions are themselves taken to be linear functions of the mt, Sj, Sb, 
and R input values. The fact that this simple parameterization cannot describe every detailed aspect of the templates 
is no limitation for this conceptual study. It will lead to small deviations between fitted and input values, see below; 
however, the general conclusion that all four quantities can be measured independently of each other remains valid. 

Pseudo-experiments are then performed using simulated events as pseudo-data. A binned likelihood fit is performed 
to determine mt, Sj, Sb, and R in each pseudo-experiment, with the likelihood £ as a function of m t , Sj, Sb, and R 
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FIG. 2: For simulated events with m t = 175 GeV, Sj = Sb = 1, and i? = l v / GeV, the two-dimensional mJ GCO vs. S^ 600 estimator 
distribution is shown if (a) cuts (S2) and (S4) are not applied and (b) the full event selection is applied. In (c), two-dimensional 
estimator distributions are shown with separate color codes for (rot, Sj,) input values of (185 GeV, 0.8): green, (185 GeV, 1.2): 
magenta, (165 GeV, 0.8): red, and (165 GeV, 1.2): blue. The input values of Sj = 1 and R = lVGeV have been kept. Here, 
only those bins are shown that contain at least 2% of the events. The reconstructed estimator distributions cluster around the 
input mt and Sb values. In all plots, the size of the squares is proportional to the number of entries. 



hypotheses given by 

nbinu(e) 

- lxi£ (m t , Sj, St, R) = - } } D e (i) lnT e (m t , Sj, Sb, R; i) ■ (4) 

e — m\ cco . i—1 
greco greco 
J ' b 

The first sum is over the three estimator quantities, generically denoted by e, and the second sum runs over the bins 
i of each estimator distribution. The number of pseudo-data events in bin i of estimator distribution e is denoted by 
D e (i), while T e (mt, Sj, Sb, R', i) is the value of the function used to parameterize the template, evaluated at the center 
of bin i under the assumption of given m t , Sj, Sb, and R values. In each pseudo-experiment, the four-dimensional 
— ln£ space is scanned, and the values of mt, Sj, Sb, and R that minimize — ln£ are taken as measured values. 

For various sets of input parameter values, 1000 pseudo-experiments with 300 events each have been evaluated. For 
m t = 175 GeV, this corresponds to an integrated luminosity of 100pb -1 /e£ at the LHC, where e& is the efficiency of 
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FIG. 3: Estimator distributions for simulated events with different input parameters. All distributions are normalised. Param- 
eter values used in the event generation are m t = 175 GeV, Sj = Sb = 1, and R = l\/GeV except as noted in the plots. Plot 
(a) shows how the rn\ cc ° template depends on the input mt value, plot (b) shows the dependence on the R value. Similar plots 
are shown in (c) and (d) for the dependence of the Sj CC ° distribution on the input Sj and R values. In (e), the dependence of 
the Sl cc ° template on the Sb input is visualized, while plot (f) shows that there is no large dependence of the SI' 
on the ra t input value. The functions used to parameterize the templates as described in Section HVl are overlaid. 
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identifying a &-quark jet. For a given set of inputs, the distributions of measurement values have been fitted with a 
Gaussian to determine the expected measurement value and its statistical uncertainty, as shown in Figure [U With 
an integrated luminosity of 100pb _1 /e^, the LHC experiments will be able to make measurements with the following 
statistical uncertainties: 

Am t (stat.) = 500 MeV (5) 
AS^stat.) = 0.004 
AS&(stat.) = 0.01 
Ai?(stat.) = 0.05 VGeV. 

The information from the pseudo-experiments is then used to obtain the calibration curves in Figure [5l which show 
that all four input parameters can be measured independently, i.e. the mean measurement value has a large slope 
with respect to the input value of the same parameter but no dependence on the values of the other parameters. The 
correlation matrix between the four measured parameters is given in Table [T] 

The bias on the top quark mass and jet energy resolution is not unexpected and is due to the approximative 
description of the templates by the fitted functions. Such effects could be alleviated by using more complicated 
functions, but that would go beyond the scope of this conceptual study, and the exact template shapes will change 
anyway when background and full detector simulation are included. In particular, it should be noted that a correction 
for biases of the raw fit results based on similar calibration curves will always be necessary in a measurement. The bias 
itself does not necessarily correspond to a systematic uncertainty; rather, systematic errors (due e.g. to uncertainties 
in the simulation of events) can be determined by studying the differences between calibration curves obtained with 
different simulation parameters. 
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TABLE I: Correlations between the four measured parameters for input values of m t = 175 GeV, Sj = Sb = 1, and R = 1 V GeV. 



V. CONCLUSION AND OUTLOOK 



While it is known how the absolute energy scale for light-quark jets can be determined using events taken at hadron 
collider experiments (using either photon+jet events, balancing the photon and jet in the transverse plane, or £+jets 
ti events, exploiting the mass constraint from the hadronically decaying W boson), independent information on the 
energy scale for 6-quark jets has so far not been obtained. On the other hand, uncertainties on the relative difference 
between the energy scales for light- and 6-quark jets lead to one of the major systematic uncertainties on measurements 
of the top quark mass at the Tevatron experiments. In this paper, it is shown how ^+jets ti events can be used to 
determine the top quark mass m t , light-quark jet energy scale Sj, 6-quark jet energy scale Sb, and a jet resolution 
parameter R independently from each other. The kinematic reconstruction of estimators on an event-by-event basis 
is discussed, and results of pseudo-experiments with conditions similar to those at the LHC experiments ATLAS and 
CMS are shown. The technique presented will enable the LHC experiments to measure the top quark mass while at 
the same time obtaining information on the energy scale for 6-quark jets. This is crucial for measurements of decays 
of the Higgs boson or of supersymmetric particles. As increasingly large datasets are accumulated by the Tevatron 
experiments, the technique may also be used by the CDF and DO experiments to reduce systematic errors on the top 
quark mass measurement. 

The study presented here is to be viewed as a proof of principle. In the future, the technique will be applied to 
fully simulated ATLAS events, including background events. Systematic studies need to be performed to assess how 
much the results depend for example on additional gluon radiation off the initial- or final-state partons. Finally, with 
the large event samples expected at the LHC experiments in mind, the method has been implemented so far based 
on a template fit. The determination of the &-quark jet energy scale can also be included naturally in the Matrix 
Element method used at the Tevatron for top quark mass measurements, which maximizes the statistical sensitivity 
(but also needs much longer computation time per selected event). Any of these studies would go beyond the scope 
of this paper but will be addressed in future publications. 
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FIG. 4: Distributions of fit results for (a) mt, (b) 5*^, (c) Sb, and (d) R for pseudo-experiments with 300 events each, generated 
with parameter values of m t = 175 GeV, Sj = Sb = 1, and R = IV GeV. The correlations between parameters are taken into 
account. All distributions are fitted with a Gaussian; the resulting mean values and resolutions are given in the plots. The 
correlation between mt and Sb is shown in (e), and that between Sj and Sb in (f). 
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FIG. 5: Calibration curves for the determination of (a) mt, (b) Sj, (c) Sb, and (d) R. For each parameter, calibration curves 
are shown for various input values for the other three parameters (left, middle, and right columns). Each circle or triangle 
shows the mean result of pseudo-experiments for a given set of input values. The black circles are identical in all three plots 
in a row, and the line shows the result of a linear fit to these points. The red downward and green upward triangles show the 
results of pseudo-experiments when one of the input parameters is varied as specified in each figure. A slight horizontal offset 
has been applied to the triangles to render them visible. Parameters whose values are not noted explicitly in a plot are set to 
the default values of m t = 175 GeV, Sj = Sb = 1, and R = lyGeV. 
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